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Background: 3CL pro protease is required for coronaviral polyprotein processing and is only active as a dimer. 

Results: MERS-CoV 3CL pro is a weakly associated dimer requiring ligand binding for dimer formation. 

Conclusion: Ligand-induced dimerization is a key mechanism for regulating the enzymatic activity of MERS-CoV 3CL pro 
during polyprotein processing. 

Significance: Activation via ligand-induced dimerization may add complexity for the development of MERS-CoV 3CL pro 
inhibitors as antivirals. 

V_ 


All coronaviruses, including the recently emerged Middle 
East respiratory syndrome coronavirus (MERS-CoV) from the 
fi-CoV subgroup, require the proteolytic activity of the nsp5 
protease (also known as 3C-like protease, 3CL pro ) during virus 
replication, making it a high value target for the development of 
anti-coronavirus therapeutics. Kinetic studies indicate that in 
contrast to 3CL pro from other |3-CoV 2c members, including 
HKU4 and HKU5, MERS-CoV 3CL pro is less efficient at pro¬ 
cessing a peptide substrate due to MERS-CoV 3CL pro being a 
weakly associated dimer. Conversely, HKU4, E1KU5, and SARS- 
CoV 3CL pro enzymes are tightly associated dimers. Analytical 
ultracentrifugation studies support that MERS-CoV 3CL pro is a 
weakly associated dimer ( K d ~52 /xm) with a slow off-rate. Pep- 
tidomimetic inhibitors of MERS-CoV 3CL pro were synthesized 
and utilized in analytical ultracentrifugation experiments and 
demonstrate that MERS-CoV 3CL pro undergoes significant 
ligand-induced dimerization. Kinetic studies also revealed that 
designed reversible inhibitors act as activators at a low com¬ 
pound concentration as a result of induced dimerization. Pri¬ 
mary sequence comparisons and x-ray structural analyses of two 
MERS-CoV 3CLpro and inhibitor complexes, determined to 1.6 
A, reveal remarkable structural similarity of the dimer interface 
with 3CL pro from HKU4-CoV and HKU5-CoV. Despite this 
structural similarity, substantial differences in the dimerization 
ability suggest that long range interactions by the nonconserved 
amino acids distant from the dimer interface may control 
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MERS-CoV 3CL pro dimerization. Activation of MERS-CoV 
3CL pro through ligand-induced dimerization appears to be 
unique within the genogroup 2c and may potentially increase 
the complexity in the development of MERS-CoV 3CL pro inhib¬ 
itors as antiviral agents. 


Coronaviruses (CoVs) 3 * are enveloped, positive-strand RNA 
viruses that infect a variety of vertebrates, including bats, live¬ 
stock, pets, poultry, and humans (1-3). Although human CoVs 
cause respiratory illnesses of mild to moderate severity (4-9), 
two recently emerged CoVs, severe acute respiratory syndrome 
coronavirus (SARS-CoV) and Middle East respiratory syn¬ 
drome coronavirus (MERS-CoV), have demonstrated their 
potential to become a serious threat to public health. MERS- 
CoV emerged late in 2012, and unlike its predecessor SARS- 
CoV, MERS-CoV continues to exhibit up to a 35% fatality rate 
(10-12). 

Based on the sequence analysis of seven genes of the replicase 
domain, MERS-CoV has been classified as a /J-CoV genogroup 
2c member, along with closely related bat coronaviruses EIKU5 
(Pipistrellus bat) and HKU4 (Tylonycteris bat) (13,14). Increas¬ 
ing evidence suggests that bats may serve as zoonotic reservoirs 
for MERS-CoV (15, 16). Evidence presented by recent studies 
also supports the local zoonotic transmission of MERS-CoV 
from dromedary camels to humans (17, 18). Alarmingly, 
human-to-human transmission during close contact, especially 
in elderly or patients with underlying health conditions, has 
also been reported for MERS-CoV (19-22). In the wake of the 
recent upsurge in the laboratory-confirmed cases of MERS- 
CoV, including two recently identified cases in the United 
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States (23), there is an urgent need to study and characterize the 
properties of important drug targets of MERS-CoV for the 
development of effective therapeutics. 

Coronaviruses express a >800-kDa replicase polyprotein, 
which is processed by viral 3CL pro protease (or nsp5) at 11 dis¬ 
tinct cleavage sites to yield intermediate and mature nonstruc- 
tural proteins (nsp) responsible for many aspects of virus repli¬ 
cation (3, 24-26). Because of its indispensable role in the virus 
life cycle, 3CL pro is an important target for therapeutic inter¬ 
vention against coronavirus infections (27-33). 

A number of kinetic, biophysical, and x-ray structural studies 
have demonstrated that SARS-CoV 3CL pro is only active in 
vitro as a tightly associated dimer with a dimer dissociation 
constant (I< d ) in the low nanomolar range (34-38). The addi¬ 
tion or deletion of amino acids, e.g. His 6 affinity tags, at either 
the N or C terminus drastically reduces the enzymatic rate and 
decreases the ability of SARS-CoV 3CL pro to dimerize (37). 
Although cellular evidence for the auto-cleavage mechanism 
(cis versus trans ) of 3CL pro is lacking, models for how 3CL pro 
cleaves itself from the polyprotein to form the mature dimer 
have been proposed based on in vitro studies using purified 
3CL pro (34, 39, 40). A current model posits that two inactive 
3CL pro molecules within two separate polyproteins recognize 
each other and form an immature dimer capable of cleaving the 
nsp4 | nsp5 and nsp5 | nsp6 sites in trans, followed by forma¬ 
tion of an active and mature dimer that can then rapidly process 
other cleavage sites and multiple polyproteins. It has also been 
proposed that substrate-induced dimerization regulates the 
enzymatic activity of SARS-CoV 3CL pro during virus replica¬ 
tion; however, no experimental evidence of this has ever been 
demonstrated in infected cells (40). Although our knowledge of 
SARS-CoV 3CL pro is extensive, the dimerization properties of 
3CL pro from MERS-CoV and other coronaviruses, as well as the 
factors regulating their enzymatic activity, remain largely 
unknown. 

To understand the properties of MERS-CoV 3CL pro , we con¬ 
ducted a series of kinetic, biophysical and x-ray structural stud¬ 
ies. Here, we report a detailed kinetic and biophysical analysis of 
MERS-CoV 3CL pro activity and dimerization. These kinetic 
and biophysical studies provide evidence for a weakly associ¬ 
ated MERS-CoV 3CL pro dimer. In addition, we utilized our pre¬ 
vious knowledge on the design of potent SARS-CoV 3CL pro 
peptidic inhibitors to design a series of inhibitors of MERS-CoV 
3CL pro that exhibit low micromolar potency. We demonstrate 
that MERS-CoV 3CL pro requires the binding of a ligand for 
dimer formation, indicating that ligand-induced dimerization 
is likely a key mechanism in the regulation of MERS-CoV 
3CL pro activity during virus infection. 

Experimental Procedures 

Construct Design and Expression of MERS-CoV 3CL pro —The 
gene encoding 3CL pro protease of MERS-CoV (amino acid res¬ 
idues 3248-3553 in the replicase polyprotein, GenBank™ 
accession number AHC74086.1) was codon-optimized for 
optimal expression in E. coli (BioBasic Inc). The gene was sub¬ 
cloned into pET-1 la expression vector with an N-terminal His 6 
tag followed by the nsp4 | nsp5 auto-cleavage site using the 
forward primer 5' - AT AT ACATAT GC ACC ACCACC AC- 


CACCACAGCGGTGTTCTGCAGTCTGGTC-3' and the 
reverse primer 5 '-GACGG AT CCTTACTGCAT CACAA- 
CACCCATGATCTGC-3'. The construct was verified by DNA 
sequencing at the Purdue University Genomics Core Facility. 
This construct results in the expression of MERS-CoV 3CL pro 
without any N- or C-terminal extensions. MERS-CoV 3CL pro 
was expressed through auto-induction in Escherichia coli 
BL21-DE3 cells in the presence of 100 qg/ml carbenicillin as 
described previously (41). Cells were harvested by centrifuga¬ 
tion at 5000 X gior 20 min at 4 °C, and the pellets were stored at 
— 80 °C until further use. 

MERS-CoV3CL pro Purification —Frozen pellets from 4 liters 
of bacterial cell culture were thawed on ice and resuspended in 
250 ml of Buffer A (20 mM Tris, pH 7.5, 0.05 mM EDTA, 10% 
glycerol, and 5 mM /3-mercaptoethanol (BME)), containing 500 
qg of lysozyme and a small amount of DNase. Cells were then 
lysed using a single pass through a French press at 1200 p.s.i., 
and cell debris was removed from the cleared lysate by centri¬ 
fuging at 29,000 X g for 30 min. Solid ammonium sulfate was 
added to the cleared lysate to a final concentration of 1 m 
through gradual mixing on ice. 

Hydrophobic Interaction Chromatography —The cleared 
lysate, mixed with ammonium sulfate, was loaded at a flow rate 
of 3 ml/min onto a 60-ml phenyl-Sepharose 6 fast-flow high- 
sub column (Amersham Biosciences) equilibrated with Buffer 
B (50 mM Tris, pH 7.5,1 m ammonium sulfate, 0.05 mM EDTA, 
10% glycerol, and 5 mM BME). The column was then washed 
with 5 X column volume (300 ml) of Buffer B at a flow rate of 4 
ml/min. Protein was eluted using a 5 X column volume (300 ml) 
linear gradient to 100% Buffer A. Fractions (12 ml) were col¬ 
lected, and those containing MERS-CoV 3CL pro , as judged 
through SDS-PAGE analysis and specific activity measure¬ 
ments, were pooled (120 ml) and exchanged into 2 liters of 
Buffer A via overnight dialysis in a 10,000 molecular weight 
cutoff SnakeSkin® dialysis tubing (Thermo Scientific). 

DEAE Anion-exchange Chromatography —Dialyzed sample 
from the previous step was loaded at a flow rate of 3 ml/min 
onto a 120- ml DEAE anion-exchange column (Amersham Bio¬ 
sciences) equilibrated with Buffer A. The column was then 
washed with 2X column volume (240 ml) of Buffer A at a flow 
rate of 4 ml/min. A linear gradient (total volume 480 ml) to 40% 
Buffer C (50 mM Tris, pH 7.5, 1 m NaCl, 0.05 mM EDTA, 10% 
glycerol, and 5 mM BME) was used to elute the protein. Frac¬ 
tions (6 ml) were collected, and those containing MERS-CoV 
3CL pro were pooled (66 ml) and dialyzed for 4 h in 4 liters of 
Buffer D (20 mM MES, pH 5.5, 0.05 mM EDTA, 10% glycerol, 
and 5 mM BME). 

Mono S Cation-exchange Chromatography —Following dial¬ 
ysis, the pH of the sample was manually adjusted to 5.5 using 1 
M solution of MES, pH 5.5, and any precipitated protein was 
removed by filtering through a 0.22 -pm pore size Millex-GP 
filter (Millipore). The filtered sample was then loaded at a flow 
rate of 2 ml/min onto an 8-ml Mono S 10/100 column (Amer¬ 
sham Biosciences) equilibrated in Buffer D. The column was 
then washed with 5 X column volume (40 ml) of Buffer D at a 
flow rate of 2 ml/min. Protein was eluted using a 25 X column 
volume (200 ml) and a linear gradient to 50% Buffer E (50 mM 
MES, pH 5.5,1 m NaCl, 0.05 mM EDTA, 10% glycerol, and 5 mM 
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BME). Fractions (2 ml) were collected, and those containing 
MERS-CoV 3CL pro were pooled (22 ml) and concentrated to 
~5 mg/ml. 

Gel Filtration Chromatography— As the final purification 
step, the concentrated protein sample was loaded onto the 
preparation grade Superdex 75 26/60 gel filtration column 
(Amersham Biosciences) equilibrated with Buffer F (25 mM 
FIEPES, pH 7.5, 10% glycerol, 2.5 mM dithiothreitol (DTT)). 
Protein was eluted isocratically at a flow rate of 1 ml/min with 
Buffer F. Fractions (2 ml) containing MERS-CoV 3CF pro were 
pooled (total volume of 34 ml) and concentrated to ~5 mg/ml. 
For final storage of the purified MERS-CoV 3CF pro enzyme, 
300-/a1 protein aliquots were placed into 1-ml screw-cap vials, 
flash-frozen under liquid nitrogen, and then stored at —80 °C 
until further use. 

Purification of SARS-CoV, HKU4-CoV, and HKUS-CoV 
3CL pro - SARS-CoV 3CF pro and HI<U5-CoV 3CF pro with 
authentic N and C termini were expressed and purified as 
described previously (37, 42). HI<U4-CoV 3CF pro was purified 
utilizing a modified protocol from Ref. 42. Final protein yield 
was calculated based on the measurement of total activity units 
(/am product/min), specific activity (units/mg), and milligrams 
of protein obtained (Bio-Rad protein assay) after each chro¬ 
matographic step. 

Synthesis of Compounds 1-11 —The peptidomimetic com¬ 
pounds with Michael acceptor groups (compounds 1 - 9 , Table 
3) were synthesized via very similar methods to those published 
previously (30, 43). Synthesis of noncovalent peptidomimetic 
compounds 10 and 11 (Table 3) has been described previously 
(33). 

Fluorescence-based Kinetic Assays—The enzymatic activity 
of 3CF pro was measured using the following custom-synthe¬ 
sized peptide: HilyteFluor™-488-ESATLQSGFRKAK-(QXF™- 
520)-NH 2 (AnaSpec, Inc.). The HilyteFluor™-488 fluo¬ 
rescence group was internally quenched by QXL™-520 dye. 
This substrate works as a generic peptide substrate for 3CF pro 
enzymes and was designed based on the nsp4 | nsp5 cleavage 
sequence for many coronavirus 3CF pro enzymes. The rate of 
enzymatic activity was determined at 25 °C by following the 
increase in fluorescence (A excitation = 485 nm, A emission = 528 
nm, bandwidths = 20 nm) of Hilyte Fluor™-488 upon peptide 
hydrolysis by the enzyme as a function of time. Assays were 
conducted in black, half-area, 96-well plates (Corning Glass) in 
assay buffer (50 mM HEPES, pH 7.5, 0.1 mg/ml BSA, 0.01% 
Triton X-100, and 2 mM DTT) using a final reaction volume of 
100 /a1. The resulting florescence was monitored using a BioTelc 
Synergy HI plate reader. The rate of the reaction in arbitrary 
fluorescence units/s (AFU/s) was determined by measuring the 
initial slope of the progress curves, which were then converted 
to units of micromolars of product produced per min (/AM/min) 
using experimentally determined values of fluorescence 
“extinction coefficient” as described previously (37). All reac¬ 
tions were carried out in triplicate. 

Determination of Enzymatic Efficiency— The apparent enzy¬ 
matic efficiency for each of the 3CF pro enzymes was determined 
by measuring the rate of enzymatic activity as a function of 
varying substrate concentration in 100-/a1 reactions. Reactions 
were initiated by the addition of enzyme to the wells of an assay 


plate containing varying concentrations of substrate. The final 
substrate concentrations varied over a range from 0 to 2 /am. 
The final enzyme concentrations for each 3CF pro studied were 
as follows: MERS-CoV 3CF pro at 1 /am, SARS-CoV 3CF pro at 
100 nM, HI<U5-CoV 3CF pro at 250 nM, and HI<U4-CoV 3CF pro 
at 200 nM. Because 3CF pro enzymes cannot be saturated with 
this substrate at a substrate concentration that would still allow 
accurate fluorescent measurements without the inner filter 
effect, only the apparent k cat IK m values can be determined from 
the slope of the line that results from a plot of the enzymatic 
activity (y axis), normalized for the total enzyme concentration, 
against the substrate concentration (x axis). 

Influence of Dimerization on the Activity of3CL pro Enzymes — 
The dependence of the enzymatic activity on the total enzyme 
concentration was determined using the FRET-based assay 
described above. The final enzyme concentrations were varied 
over a concentration range from 2 /am to 100 nM for MERS-CoV 
3CF pro , 500 to 10 nM for SARS-CoV 3CF pro , 250 to 0.6 nM for 
HI<U5-CoV 3CF pro , and 200 to 10 nM for HI<U4-CoV 3CF pro . 
Reactions were initiated by the addition of substrate, at a final 
concentration of 2 /am, to the assay plates containing varying 
enzyme concentrations in the assay buffer. Initial rates were 
determined from the initial slopes of the progress curves at each 
enzyme concentration. 

The rates of the 3CF pro -catalyzed reactions measured over a 
range of enzyme concentrations can be fit to either Equation 1 
or 2 to determine the values of the dissociation constant for the 
monomer-dimer equilibrium as well as the turnover numbers. 
Nonlinear regression and the program TableCurve 2D version 
4.0 were used to fit the data to either Equation 1 or 2 below (44). 

— K d + ^IKf + 8 K d C T 

^max . 


, K d + 4 C t — + 8K d C T 

Acat,D-g- (Eq. 1) 


In Equation 1, V max is the rate of the enzymatic activity calcu¬ 
lated at each enzyme concentration (C r ); K d is the monomer- 
dimer equilibrium dissociation constant, and k catiM and A cat D 
are the turnover numbers for the monomer and the dimer, 
respectively. 


1/max = k c JD] = k c 


K d + 4C T - ^Kf + 8 K d C T 
" 8 


(Eq. 2) 


In Equation 2, V max , C T , and K d have been described previously, 
and A cat is the turnover number for the dimer only. 

Inhibition Assays —To determine the percent inhibition for 
compounds 1 - 9 , the total concentration of the substrate was 
fixed at 1.0 /am, and the enzymes were fixed at 250 nM for SARS- 
CoV 3CL pro , HKU5-CoV 3CL pro , HKU4-CoV 3CL pro , and at 
500 nM for MERS-CoV 3CL pro . DMSO stocks (100 X) of the 
compounds were diluted a hundred-fold to a final concentra¬ 
tion of 50 /am in 80 /a1 of the enzyme solution and incubated for 
20 min. After 20 min, the enzymatic activity was measured as 
initial slope of the progress curve, obtained by initiating the 
reaction with 20 /a1 of 5 /am substrate. % inhibition was calcu¬ 
lated using Equation 3. 


AUGUST 7, 2015-VOLUME 290-NUMBER 32 J^SBMB 


JOURNAL OF BIOLOGICAL CHEMISTRY 19405 


Downloaded from http://www.jbc.org/ at University of Pittsburgh on August 10, 2015 








Ligand-induced Dimerization Regulates MERS-CoV3CL pro 


% inhibition 


(r3t6 sam p| e rate ne g)\ 

—---f- X 100 

(rate pos ~ rate neg ) / 

(Eq.3) 


In Equation 3, rate sample is the initial slope of the progress curve 
in AFU/s measured in the presence of the compound; rate pos is 
the initial slope measured in the absence of any compound, and 
rate neg is the baseline substrate hydrolysis calculated in the 
absence of enzyme. All the reactions were carried out in tripli¬ 
cate and contained a final DMSO concentration of 1%. For 
compounds displaying more than 50% inhibition, a more exten¬ 
sive characterization of the inactivation kinetics was performed 
through progress curve analysis. To the reaction well, 20 /a1 of 5 
/am substrate was added to a final concentration of 1 /am, and 
the total inhibitor concentration [7] total was varied from 0 to 50 
/AM. The reaction was initiated with the addition of 80 pi of 
MERS-CoV 3CF pro to a final concentration of 500 nM. Fluores¬ 
cence intensity was then measured over time as AFU t for a 
period of 70 min. Equation 4 describes the resulting time course 
of reaction. 


[ P]t = 7T-0 - exp(-/c obs f) + [P], (Eq.4) 

''obs 

In Equation 4, v, is the initial velocity of the reaction; k obs is the 
observed first-order rate constant for the reaction in the 
absence and presence of inhibitor; t is the time in minutes; [P] t 
is the concentration of product produced at time t, and [P], is 
the initial product concentration, which is zero. Product con¬ 
centrations were calculated from the values of AFU 4 , using the 
experimentally determined fluorescence extinction coefficient. 
The resulting values of [P] t were then plotted against time t, and 
the data were fit to Equation 4 with [P] l = 0 using the nonlinear 
regression program TableCurve 2D to derive the fitted param¬ 
eters v i and /c obs and their associated errors Av,- and A/c obs . 

Values for each k obs were then plotted against [7] total and the 
data were fit to Equation 5. 




^inactWtotal 
Kl + [/]total 


(Eq. 5) 


In Equation 5, /q nact defines the maximum rate of inactivation at 
infinite inhibitor concentration, and K f defines the concentra¬ 
tion of inhibitor that yields a rate of inactivation equal to 
y 2 k inact . The half-life of inactivation at infinite inhibitor concen¬ 
tration, which is a measure of inactivation efficiency, is defined 
as tv, = 0.693/k inact . 

AUC Analysis —To determine the oligomeric state of MERS- 
CoV 3CL pro , sedimentation velocity experiments were per¬ 
formed at 20 °C on the Beckman-Coulter XLA ultracentrifuge 
using varying concentrations of MERS-CoV 3CL pro (4 -23 /am) 
in 25 mM HEPES, pH 7.5, 50 mM NaCl, and 1 mM tris(2-car- 
boxyethyl)phosphine at 50,000 rpm. To characterize the effect 
of the ligand on the monomer-dimer equilibrium of MERS- 
CoV 3CL pro , sedimentation velocity experiments were con¬ 
ducted on the Beckman-Coulter XLI instrument using different 
stoichiometric ratios of MERS-CoV 3CL pro with compounds 6 
and 10 . Samples were prepared by mixing 25 /am MERS-CoV 


3CL pro with 25, 50, and 100 /am compound 6 or 10 and incu¬ 
bating the mixture overnight at 4 °C before performing the 
experiments. Absorbance optics (280 nm) and interference 
optics were utilized for protein detection. Solvent density, vis¬ 
cosity, and partial specific volumes were calculated using 
SEDNTERP. SEDPHAT was used to fit the data to the mono¬ 
mer-dimer self-association model to estimate the sedimenta¬ 
tion coefficients (s), apparent molecular weights, and I< d and 
k o{£ values from size distribution analysis. To obtain exact 
molecular weights, sedimentation equilibrium experiments 
were performed at concentrations of 3 and 17 /am MERS-CoV 
3CL pro . The experiments were done at 20 °C utilizing a two- 
channel centerpiece and run at multiple speeds (8100, 13,800 
and 24,000 rpm) in a AN-60 Ti rotor. 

MERS-CoV 3CL pro Activation and Inhibition by a Noncova- 
lent Inhibitoi —The rates of the MERS-CoV 3CL pro -catalyzed 
reactions were determined at final enzyme concentrations of 
0.5,1.0, and 2.0 /am and in the absence and presence of varying 
concentrations (0.1-60 /am) of compound 10 . The substrate 
concentration was fixed at 2.0 /am. DMSO stocks (100 X) of 
compound 10 were diluted a hundred-fold in 80 /a1 of enzyme 
solution and incubated for 10 min. At the same time, a zero- 
inhibitor control reaction was set up by mixing DMSO to a final 
concentration of 1% into 80 /a1 of enzyme solution. After 10 
min, the rate of the enzymatic activity was measured as the 
initial slope of the progress curve, obtained by initiating the 
reaction with 20 /a1 of 10 /am substrate. Equation 6 was utilized 
to calculate the percent activity. 


% activity = 


(rate sam p| e rat6 neg ) 
(rate pos - rate neg ) 


X 100 


(Eq. 6) 


The rate sample , rate pos , and rate neg are as described above for 
Equation 3. 

MERS-CoV 3CL pro Crystallization, X-ray Data Collection, 
and Structure Determination —Purified MERS-CoV 3CL pro 
was concentrated to 1.6 mg/ml in 25 mM HEPES, pH 7.5, and 
2.5 mM DTT. Inhibitor complexes of MERS-CoV 3CL pro with 
compounds 6 and 11 were formed by incubating MERS-CoV 
3CL pro with the compounds in a 1:3 stoichiometric ratio at 4 °C 
overnight. After iterative rounds of optimization of the crystal¬ 
lization conditions based on the initial hits obtained from high 
throughput screening of Qiagen Nextel Screens, crystals of 
MERS-CoV 3CL pro inhibitor complexes suitable for x-ray dif¬ 
fraction were grown by the hanging-drop, vapor diffusion 
method at 20 °C in 0.2 m sodium acetate, 0.1 m BisTris, pH 7.0, 
and 20% PEG-3350 for the MERS-CoV 3CL pro and 6 complex, 
and 0.2 m ammonium acetate, 0.1 m BisTris, pH 5.5,12% PEG- 
3350 for the MERS-CoV 3CL pro and 11 complex. For x-ray data 
collection, crystals were flash-cooled in liquid nitrogen after 
dragging the crystals through a cryo-solution that con¬ 
tained the crystallization solution supplemented with 15% 
2-methyl-2,4-pentanediol. 

X-ray diffraction data were collected for MERS-CoV 3CL pro 
and 6 and MERS-CoV 3CL pro and 11 complexes at the Lilly 
Research Laboratories Collaborative Access Team (LRL-CAT) 
Sector 31 and the Life Sciences Collaborative Access Team (LS- 
CAT) Sector 21 at the Advanced Photon Source, Argonne 
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TABLE 1 

Purification summary of MERS-CoV 3CL pr ° per liter of E.coli BL21-DE3 cells 


Sample 

Protein 

Total activity units 

Specific activity 

Fold purification 

% yield 


mg 


units/mg 



Lysate 

1102 

1168 

1 

1 

100 

Phenyl-Sepharose 

219 

185 

1 

1 

16 

DEAE 

22 

189 

8 

8 

16 

Mono S 

15 

142 

9 

9 

12 

Superdex 75 

12 

114 

10 

10 

10 


A. 



B. 


3CL pro Enzyme 

7WK m (» 10 2 ) (pM'Lmin' 1 ) 

MERS-CoV 

3.1 ±0.03 

SARS-CoV 

15.5 ±0.9 

HKU5-CoV 

8.8 ±0.1 

HKU4-CoV 

11.3 ±0.4 


FIGURE 1. Comparison of enzymatic efficiencies ( k cat IK m ) of 3CL pro enzymes from different CoVs. A, rates for the enzymatic activity, normalized to the total 
enzyme concentration, are plotted as a function of varying substrate concentrations. Total concentration of each enzyme in the final reaction is as follows: 
MERS-CoV 3CL pro at 1 p.M; SARS-CoV 3CL pro at 100 nM; HKU5-CoV 3CL pro at 250 nM; and HKU4-CoV 3CL pro at 200 nM. Slope of the line represents the apparent 
value of k cat /K m . Error bars represent the standard deviation for triplicate data. 6, *, apparent value of k cat /K m for the nonsaturable substrate, calculated as the 
slope of the linear plot from panel A. 


National Laboratory, respectively. Data were processed and 
scaled using Mosflm version 7.0.5 (45) and HKL2000 version 
706 (46). The method of molecular replacement was used to 
obtain initial phases using the program PHASER-MR in Phenix 
suite version 1.8.4 (47). For MERS-CoV 3CL pro and 6 complex, 
the x-ray structure of SARS-CoV 3CL pro (PDB code 3V3M) was 
used as a phasing model (32). The final MERS-CoV 3CL pro and 
6 complex structure was then used to calculate the initial 
phases for the MERS-CoV 3CL pro and 11 complex model. 
Automated model building using Autobuild in Phenix was ini¬ 
tially used to build a preliminary model of the MERS-CoV 
3CL pro and 6 inhibitor complex. Each structure was then 
refined using iterative cycles of refinement using Phenix Refine 
coupled to manual model building using COOT (48) based on 
F 0 — F c and 2 F 0 — F c maps. Coordinates and molecular library 
files for inhibitor molecules were built using the program 
eLBOW in the Phenix suite. Water molecules were added to 
peaks in residual (F n — F c ) density maps that were greater than 
3cr using the “Find Water” function in COOT. MolProbity was 
used to assess structural quality of the final model (49). The 
measured structure factor amplitudes and the atomic coordi¬ 
nates for the final structures were deposited in the Protein Data 
Bank with accession codes 4RSP (MERS-CoV 3CL pro and 6 
complex) and 4YLU (MERS-CoV 3CL pro and 11 complex), 
respectively. Structural superposition was performed using the 
method of least squares fitting of C-a atoms in COOT. PyMOL 
was used to generate figures of all the structures (50). 

Results 

Production of MERS-CoV 3CL pro with Authentic N and C 
Termini— Insertion of the nsp4 | nsp5 cleavage site between 
the N-terminal His 6 tag and the coding region for MERS-CoV 
3CL pro results in autoprocessing of the His tag and overexpres¬ 


sion of MERS-CoV 3CL pro without any N-terminal extension in 
E. coli BL21-DE3 cells. MERS-CoV 3CL pro was purified to high 
purity and an overall yield of 10% using four sequential chro¬ 
matographic steps. A summary of the percent enzyme yield, 
total activity units, and the fold-purification after each chro¬ 
matographic step is summarized in Table 1. Approximately 12 
mg of highly pure MERS-CoV 3CL pro can be obtained per liter 
of bacterial cell culture. 

To verify the production of the enzyme with correct N and C 
termini, the molecular mass of purified MERS-CoV 3CL pro was 
determined by MALDI to be 33.4 kDa, which is close to the 
theoretical molecular mass of 33.3 kDa for the authentic/ma¬ 
ture MERS-CoV 3CL pro monomer. Western blot analysis of 
purified MERS-CoV 3CL pro using an anti-His 6 antibody also 
confirmed the absence of the N terminus His 6 tag associated 
with the expression plasmid (data not shown). These results 
demonstrate that the N-terminal His 6 tag is auto-catalytically 
removed by MERS-CoV 3CL pro during its expression in E. coli, 
indicating MERS-CoV 3CL pro is enzymatically active when 
expressed in E. coli. 

MERS-CoV 3CL pro Elydrolyzes a Fluorescent Peptide Sub¬ 
strate with Lower Efficiency than Other 3CL pro Enzymes —A 
FRET-based peptide substrate was used to measure the enzy¬ 
matic activity of MERS-CoV 3CL pro as a function of substrate 
concentration over a substrate concentration range from 0 to 
2.0 /jlm (Fig. L4). We observed that MERS-CoV 3CL pro cannot 
be saturated by the substrate over this concentration range, 
which is typical for other coronavirus 3CL pro enzymes because 
the K m values for peptide substrates approach 1 mM (51-54). 
Therefore, the slope of the kinetic response of MERS-CoV 
3CL pro to increasing substrate concentration was determined 
to derive an apparent (k cat //<C m ) value, which is a measure of 
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FIGURE 2. Dependence of the enzymatic activity of MERS-CoV, HKU4-CoV, 
HKU5-CoV, and SARS-CoV 3CL pro on the total enzyme concentration. A, 

kinetic response of each CoV 3CL pro to increasing enzyme concentration is plot¬ 
ted along with the resulting fit of the data to Equation 2. Resulting values for the 
apparent turnover number, k cat , and the monomer-dimer equilibrium constant, 
K d , are shown in Table 2. Final enzyme concentrations varied over the concentra¬ 
tion ranges of 2 /jlm to 100 nM for MERS-CoV 3CL pro , 500 to 10 nM for SARS-CoV 
3CL pro , 250 to 0.6 nM for HKU5-CoV 3CL pro , and 200 to 10 nM for HKU4-CoV 3CL pro . 
Final substrate concentration was fixed at 2 /jlm. Experiments were done in tripli¬ 
cate. Error bars represent the standard deviation for triplicate data. Shaded box 
represents the data that are plotted in 6.8, enlarged view of the fitted data at low 
total enzyme concentrations, marked in shaded box in A, illustrating the nonlinear 
dependence of enzymatic activity on the total concentrations of 3CL pro from 
SARS-CoV, HKU5-CoV, and HKU4-CoV. 

enzymatic efficiency. We also determined and compared the 
apparent (k cd JK m ) values for 3CL pro enzymes from SARS-CoV, 
HI<U5-CoV, and HI<U4-CoV under similar experimental con¬ 
ditions (Fig. LB). MERS-CoV 3CL pro is able to hydrolyze the 
peptide substrate; however, the enzymatic efficiency of MERS- 
CoV 3CL pro (k c JK m = 3.1 ± 0.03 X 10“ 2 pjvT 1 min -1 ) is 
noticeably lower than other 3CL pro enzymes tested. Specifi¬ 
cally, MERS-CoV 3CL pro was 5-fold less efficient at processing 
the peptide substrate when compared with SARS-CoV 3CL pro . 
Even among the j3-CoVs from the same 2c genogroup (MERS, 
HKU5, and HKU4), MERS-CoV 3CL pro was the least efficient 
enzyme. 

MERS-CoV3CL pro Is a Weakly Associated Dimer —Because a 
dimer has consistently been shown to be the catalytically active 
form of all 3CL pro enzymes studied to date, we tested the 
hypothesis that the lower enzymatic efficiency of MERS-CoV 
3CL pro is a result of the reduction in its ability to dimerize. 
Therefore, we determined the dependence of the enzymatic 
activity of MERS-CoV 3CL pro on the total enzyme concentra¬ 
tion and compared it with other 3CL pro enzymes from HKU4, 
HI<U5, and SARS coronaviruses (Fig. 2). 


TABLE 2 

Comparison of the apparent turnover number, k cat , and the monomer- 
dimer dissociation constant, K d , for 3CL pro from different CoVs 


3CL pro 

Nonlinear fitting of kinetic data* 

k b 

'cat 

K d 


min* 1 

/JLM 

MERS-CoV 

0.2 ± 0.02 

7.8 ± 1.3 

SARS-CoV 

0.47 ± 0.03 

0.06 ± 0.01 

HI<U5-CoV 

0.53 ± 0.02 

0.06 ± 0.01 

HKU4-CoV 

0.84 ± 0.07 

0.1 ± 0.03 


a Values were determined through nonlinear fitting of the kinetic data to Equation 
2 . 

b k cat represents the apparent turnover number. 


It is immediately apparent from the data plotted in Fig. 2 that 
the response of MERS-CoV 3CL pro enzymatic activity to an 
increasing enzyme concentration is nonlinear. The strong cur¬ 
vature suggests that a dimer is either the most active form or the 
only active form of MERS-CoV 3CL pro . To determine the 
mechanism of dimerization, the data in Fig. 2 were first fit to 
Equation 1 (see “Experimental Procedures”), which describes a 
model where both the monomer and the dimer are active. A fit 
of the data to Equation 1 yielded a negative turnover value for 
the monomer (k cattM ), suggesting the monomer is inactive and 
that the dimer is the only active form of the enzyme. Therefore, 
the data were fit to Equation 2 (see “Experimental Procedures”), 
which considers only the dimer as the active form of the 
enzyme. The kinetic data for all four 3CL pro enzymes, MERS- 
CoV, HKU4-CoV, HKU5-CoV, and SARS-CoV, fit well to this 
model, and the resulting values for the monomer-dimer equi¬ 
librium dissociation constant, K d , and apparent turnover num¬ 
ber, /c cat , for each enzyme are provided in Table 2. 

The lower Ar cat value for MERS-CoV 3CL pro , when compared 
with other coronavirus 3CL pro enzymes, indicates a moderate 
reduction (2-4-fold) in its ability to turn over the substrate, 
which is consistent with the observed lower apparent (, k cat IK m ) 
value. In contrast, there is a substantial reduction in the ability 
of MERS-CoV 3CL pro to dimerize compared with the other 
3CL pro enzymes. Based on the K d values, the capacity of MERS- 
CoV 3CL pro to dimerize is ~78 -130-fold weaker than the other 
enzymes (Table 2). These results indicate that the MERS-CoV 
3CL pro dimer is much more weakly associated than the other 
coronavirus 3CL pro enzymes studied, and these results raise 
questions as to the structural and mechanistic differences 
among the 3CL pro enzymes that ultimately regulate protease 
activity during coronavirus replication. 

MERS-CoV 3CL pro Inhibition by Designed Peptidomimetic 
Compounds— In an effort to develop potent inhibitors of 
MERS-CoV 3CL pro , we designed and synthesized nine peptido¬ 
mimetic compounds containing a Michael acceptor group, i.e. 
an a,j3-unsaturated carbonyl, capable of irreversibly reacting 
with the active site cysteine of MERS-CoV 3CL pro (Table 3). 
These compounds were designed and synthesized based on our 
understanding and knowledge of the interactions of similar 
inhibitor molecules with SARS-CoV 3CL pro (30, 31). At a con¬ 
centration of 50 pM, compounds 6-9 displayed more than 50% 
inhibition of MERS-CoV 3CL pro and were further evaluated 
for their ability to inactivate the enzyme in a time- and 
concentration-dependent manner (Fig. 3). Data from the 
kinetic progress curve for compound 6 (Fig. 3), as well as for 
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TABLE 3 

Chemical structures and inhibitory activity of compounds 1 to 11 against MERS-CoV 3CL pro 

The Michael acceptor group for compound 1 is shaded to highlight this group for all the compounds. The stereochemistry at the benzyl stereocenter of compound 5 is a 1:1 
mixture of enantiomers (racemic); therefore, the compound was tested as a mixture of diastereomers. 



* % inhibition was measured as the % loss in enzymatic activity after 20 min of incubation of 500 nM MERS-CoV 3CL pro with 50 p-M of the compound. 
a As compounds 1-5 showed <50% inhibition of MERS-CoV 3CL pro , values of £ 1 / 2 °° and Kj were not determined (nd) for these compounds. 

^inact is X10 -3 S -1 . 
c hi 2 ° is X 10 3 s. 
d Kj is in jUM. 

e IC 50 values for compounds 10 and 11 were calculated from a dose- response curve determined after 10 min of incubation of 1 /am MERS-CoV 3CL pro with varying concen¬ 
trations of compounds. IC 50 is in /am. 
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compounds 7-9 (data not shown), were fit to the appropriate 
equations (see under “Experimental Procedures”) to obtain the 
kinetic parameters, /c inact , tv*, and K p and the resulting values are 
provided in Table 3. 

We identified four compounds, 6 -9, as micromolar inhibi¬ 
tors of MERS-CoV 3CL pro with /^values less than 10 /am (Table 
3). Analysis of structure-activity relationships of these com¬ 
pounds suggests that the S 2 subsite pocket of MERS-CoV 
3CL pro is small and can only accommodate a smaller P’ 2 -isobu- 
tyl substituent (compounds 6-9) but not bigger substituents 
such as ^-benzyl or P 2 -isobutylenyl (compounds 1 - 5 ). It was 
also observed that replacing the If,-ethoxy (compound 6) with 
.P 4 -isopropoxy (compounds 7 and 8) had no effect on the inhib¬ 
itory activity of the compounds. Finally, these compounds pro¬ 
vide an excellent chemical scaffold to study the molecular 
details of interactions of substrate-like compounds with the 
enzyme and to develop more potent inhibitors of MERS-CoV 
3CL pro for therapeutic intervention. 

To evaluate broad spectrum specificity of these compounds, 
we also calculated % inhibition of SARS-CoV 3CL pro , HKU5- 
CoV 3CL pro , and HI<U4-CoV 3CL pro after 20 min of incubation 
in the presence of 50 /am compounds 6-9. Except for com¬ 
pound 9, which inhibited SARS-CoV 3CL pro by 76%, we 
observed 100% inhibition of all other enzymes in the presence 
of compounds 6-9. Furthermore, we performed progress curve 


[compound 6] pM 



Time (sec) 


FIGURE 3. Progress curves for the MERS-CoV 3CL pro -catalyzed reaction in 
the presence of compound 6. Time-dependent hydrolysis of 1 /am substrate 
catalyzed by 500 nM MERS-CoV 3CL pro was measured over a time period of 70 
min and at fixed variable concentrations of compound 6 ranging from 0 to 50 
/am. Values for the inactivation kinetic parameters /t inact , tva, and K, were cal¬ 
culated by fitting the progress curve data to Equations 4 and 5. Chemical 
structure of compound 6 is shown in the inset. 


analysis of HI<U5-CoV 3CL pro and HKU4-CoV 3CL pro in the 
presence of varying concentrations of compounds 6 -9. The K r 
values of compounds 6-9 for HI<U5-CoV 3CL pro are 0.49 ± 
0.16,0.60 ± 0.21,1.30 ± 0.53, and 0.47 ± 0.06 /am, respectively. 
The Kj values of compounds 6 -9 for HKU4-CoV 3CL pro are 
0.39 ± 0.14, 0.50 ± 0.17, 0.85 ± 0.33, and 0.64 ± 0.25 /am, 
respectively. These data suggest that peptidomimetic com¬ 
pounds 6 -9 have the potential to be developed as coronavirus 
3CL pro inhibitors with broad spectrum specificity. 

Weak Association of the MERS-CoV 3CL pro Dimer Is Sup¬ 
ported by AUC Studies— To further explore the mechanism of 
MERS-CoV 3CL pro dimerization, we performed analytical 
ultracentrifugation sedimentation velocity (AUC-SV) studies 
at varying concentrations of MERS-CoV 3CL pro (Fig. 4A). 
Unlike enzyme kinetics, AUC allows determination of the 
monomer-dimer equilibrium constant ( K d ) in the absence of 
substrate. MERS-CoV 3CL pro displayed a continuous size dis¬ 
tribution at different protein concentrations. Two distinct 
peaks corresponding to monomer (2.9 S) and dimer (3.9 S) spe¬ 
cies are observed, with the dimer peak becoming more pro¬ 
nounced at higher enzyme concentrations (Fig. 4A). We fit the 
AUC data to a monomer-dimer equilibrium model to deter¬ 
mine the values for K d and /c off , where K d is the equilibrium 
dissociation constant for a monomer from the dimer, and /c off is 
the rate constant for dissociation of the monomer from the 
dimer. The resulting best fit value for K d is 52 ± 5 /am and that 
for k of{ is 10 -4 s _1 . The K d value of 52 /am for MERS 3CL pro is 
dramatically different from SARS-CoV 3CL pro , which has 
reported I< d values ranging from low nanomolar up to 10 /am 
depending on the enzyme construct used and the experimental 
conditions and methods utilized to determine the dissociation 
constant (37). The dimer affinity of MERS-CoV 3CL pro is sub¬ 
stantially weaker than that for SARS-CoV 3CL pro , when com¬ 
paring the same enzyme construct, i.e. the enzyme without any 
N- or C-terminal modifications. The AUC-SV calculated K d 
value for MERS-CoV 3CL pro is ~150,000 times higher than the 
value of 0.35 nM determined for SARS-CoV 3CL pro (34). 

The AUC results (Fig. 4A) show that the monomer peak at 
-2.9S does not gradually shift peak position toward the dimer 
peak at -3.9S with increasing concentrations of MERS-CoV 
3CL pro ; rather, the two peaks change in area, which is indicative 


A [MERS-CoV 3CL pr0 ] 

— 23 pM 



B. - MERS-CoV 3CL P ": Cmpd 6(1:1) 
- MERS-CoV 3CL pr0 : Cmpd 6 (1:2) 
MERS-CoV 3CL p ”: Cmpd 6 1:4 



C. - MERS-CoV 3CL p ”: Cmpd <0(1:1) 
- MERS-CoV 3CL P '°: Cmpd 10 1:2 
MERS-CoV 3CU»°: Cmpd 10 1:4 



sedimentation coefficient (S) 


sedimentation coefficient (S) 


sedimentation coefficient (S) 


FIGURE 4. AUC-SV analyses of ligand-induced dimerization of MERS-CoV 3CL pro . A, sedimentation coefficient distribution for varying concentrations of 
MERS-CoV 3CL pro (4.1 to 23 /am) with sedimentation coefficient values of 2.9S and 3.9S for the monomer and the dimer, respectively. The best fit value for 
AUC-SV-calculated K d is 52 ± 5 /am. 6, sedimentation coefficient distribution of MERS-CoV 3CL pro (25 /am) in the presence of different stoichiometric ratios of 
compound 6 (25, 50, and 100 /am). C, sedimentation coefficient distribution of MERS-CoV 3CL pro (25 /am) in the presence of different stoichiometric ratios of 
compound 10(25,50, and 100 /am). A significant shift in the2.9S peak (monomer) to a 4.1 S peak (dimer) is detected upon addition of increasing concentrations 
of compounds 6 and 10. 
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of very slow monomer-dimer exchange rate (/c off ~10 -4 s _1 ) 
and the formation of hydrodynamically stable monomer and 
dimer species (55). This /c off value is 1000 times slower than the 
k of[ value (10 -1 s _1 ) reported for SARS-CoV 3CL pro indicating 
that the SARS-CoVenzyme has a significantly more rapid mono¬ 
mer-dimer exchange rate (56). These observations support a 
model whereby the MERS-CoV 3CL pro dimer is weakly associ¬ 
ated, suggesting the enzyme exists mainly as a monomer in 
solution. 

MERS-CoV 3CL pro Undergoes Extensive Ligand-induced 
Dimerization —The weak association of MERS-CoV 3CL pro 
monomers engenders the following questions. “Are higher lev¬ 
els of expression of 3CL pro in MERS-CoV-infected cells neces¬ 
sary to allow formation of active dimer?” “Are other mecha¬ 
nisms such as substrate- or ligand-induced dimerizations 
involved in activating 3CL pro ?” To explore the latter question of 
ligand-induced dimerization of MERS-CoV 3CL pro , we per¬ 
formed AUC experiments in the presence of compound 6, 
which acts as a substrate mimetic and mechanism-based inhib¬ 
itor, also known as a suicide substrate. Peptidomimetic com¬ 
pounds such as compound 6, which contains a Michael accep¬ 
tor group, interact and react with the active site cysteine of 
cysteine proteases to covalently modify them. We utilized com¬ 
pound 6 to form a covalent MERS-CoV 3CL pro and inhibitor 6 
complex that is stable over long periods of time, making it ame¬ 
nable to analysis by AUC-SV experiments. In contrast, incuba¬ 
tion of a normal peptide substrate with the enzyme would lead 
to immediate hydrolysis of the substrate and dissociation of the 
products from the enzyme, confounding AUC experiments and 
subsequent data analysis. 

MERS-CoV 3CL pro was incubated with varying concentra¬ 
tions of compound 6 in stoichiometric ratios of 1:1,1:2, and 1:4. 
The modified enzyme was then subjected to AUC studies to 
determine the influence of compound 6 on the mono¬ 
mer-dimer equilibrium (Fig. 4 B). A significant shift in the area 
under 2.9S peak (monomer) to 4.1S peak (dimer) is detected 
upon addition of increasing concentrations of compound 6. We 
obtained similar results when AUC studies were performed uti¬ 
lizing a complex of MERS-CoV 3CL pro with a noncovalent pep¬ 
tidomimetic inhibitor (compound 10 , Figs. 4C). The transition 
of MERS-CoV 3CL pro from monomer to dimer in the presence 
of compounds 6 and 10 suggests that the enzyme undergoes 
extensive dimerization upon substrate binding. 

MERS-CoV 3CL pro Is Activated by Ligand-induced Dim¬ 
erization— The observed ligand-induced dimerization of 
MERS-CoV 3CL pro , as demonstrated through AUC studies, 
prompted us to investigate whether or not the enzymatic activ¬ 
ity of MERS-CoV 3CL pro could be increased at low concentra¬ 
tions of a compound via ligand-induced dimerization. To do so, 
we chose to use a noncovalent peptidomimetic compound 
(compound 10 , Fig. 5 A) that we previously identified as an 
inhibitor of SARS-CoV 3CL pro . Because of the time-dependent, 
irreversible nature of the reaction between compound 6 and 
MERS-CoV 3CL pro , use of compound 6 was not ideal for these 
kinetic studies as it would further complicate kinetic data 
analysis. 

The kinetic response of MERS-CoV 3CL pro to increasing 
concentrations of compound 10 was first measured at a single 


enzyme concentration of 1.0 pM (Fig. 5 A). Interestingly, an 
increase in the activity of MERS-CoV 3CL pro , as high as 195%, 
was observed in the presence of low inhibitor concentrations 
(0.1 to 20 pM). Inhibition of enzymatic activity was observed 
only at higher inhibitor concentrations (40 pM or greater). 
These results suggest that at low concentrations, compound 10 
binds to a monomer and induces the formation of a dimer. The 
resulting dimer then has one free active site that is capable of 
processing the substrate. At higher concentrations of inhibitor, 
the substrate and inhibitor directly compete for the free active 
site. 

The model of activation and inhibition suggested by the data 
at 1 p m enzyme would predict that at higher enzyme concen¬ 
trations less activation by a compound would be observed at 
lower inhibitor concentrations, and the inhibition of activity 
would be detected at lower inhibitor concentrations because 
the equilibrium would be pushed toward dimer formation. In 
contrast, lower enzyme concentrations would result in 
higher activation by compounds, and inhibition by the com¬ 
pound would occur at significantly higher compound con¬ 
centrations. Therefore, we further measured the activity of 
MERS-CoV 3CL pro at two additional enzyme concentrations 
(0.5 and 2.0 pM) in the presence of varying concentrations of 
compound 10 . Remarkably, we observed that the activation 
effect was most pronounced at the lowest MERS-CoV 3CL pro 
concentration tested (0.5 pM), and the effect decreased as the 
enzyme concentration was increased (1.0 and 2.0 pM) (Fig. 
5 A). Moreover, inhibition by compound 10 occurred at 
lower compound concentrations when higher concentra¬ 
tions of enzyme were used. These observations further sup¬ 
port a model whereby enzyme activation can occur through 
ligand-induced dimerization. 

The activation and inhibition of MERS-CoV 3CL pro by com¬ 
pound 10 can be explained by a simple kinetic model depicted 
in Fig. 5 B. The MERS-CoV 3CL pro monomer exists in equilib¬ 
rium with the dimer, and their relative concentrations depend 
on the total enzyme concentration. In the absence of substrate 
or compound, the I< d value is 52 pM, and the equilibrium is 
represented by the gray spheres (blue box) in Fig. 5 B. The mono¬ 
mer is unable to hydrolyze the substrate and is therefore inac¬ 
tive. Binding of inhibitor (Fig. 5 B, green triangle) to the mono¬ 
mer results in monomer to dimer switch leading to the 
formation of a dimer that contains inhibitor bound in one of 
the active sites. Once the dimer is formed, the substrate binds in 
the second active site and catalysis takes place. Under high 
inhibitor concentrations, however, the inhibitor molecule 
directly competes with substrate for the free dimer active site, 
and inhibition of the enzymatic activity is observed as a result. 

We would also expect to observe induced dimerization and 
activation in the presence of the substrate. Indeed, the mono¬ 
mer-dimer kinetic studies performed in Fig. 2 were performed 
at a fixed concentration of substrate at 2 pM. In this experiment, 
the K d value for the MERS-CoV 3CL pro dimer was determined 
to be 7.8 p m, which is lower than the K d value determined in the 
absence of substrate using AUC, thereby supporting substrate- 
induced dimerization. Given the high K m value of 3CL pro for the 
peptide substrate (51-54), even higher substrate concentra¬ 
tions would be required to observe substrate activation in a plot 
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FIGURE 5. Activation of MERS-CoV 3CL pro via ligand-induced dimerization. A, enzymatic activity of 0.5,1.0, and 2.0 jam MERS-CoV 3CL pro was measured in 
the absence and presence of varying concentrations of compound 10. Substrate concentration was fixed at 2.0 jam. % activity, normalized to zero inhibitor 
enzymatic activity, was plotted as a function of increasing inhibitor concentrations. Error bars represent the standard deviation for triplicate data. Increase in 
enzymatic activity (highlighted in cyan-shaded box) is observed in the presence of low concentrations of compound 10. Inhibition of enzymatic activity is 
observed at higher inhibitor concentrations (highlighted in yellow-shaded box). B, kinetic model describing the equilibrium between different species of 
MERS-CoV 3CL pro that are formed in the absence ( blue box) and presence ( green box ) of a ligand is shown. Based on the AUC-calculated Rvalue of ~ 52 jam, 
MERS-CoV 3CL pro primarily exists as a monomer in solution in the absence of a ligand. Upon ligand binding (inhibitor / in our case) to the monomer, the 
monomer-dimer equilibrium shifts toward dimer formation. Next, under lower inhibitor concentrations ( cyan-shaded box), substrate (S) binds in the second 
active site and catalysis takes place. However, under higher inhibitor concentrations {yellow-shaded box), inhibitor directly competes with the substrate for the 
second active site, and inhibition of the enzymatic activity is observed. 


of catalytic activity versus substrate concentration. However, 
we are limited to use our FRET-based substrate only at low 
concentrations due to a significant inner filter effect at higher 
concentrations of substrate. Therefore, a compound that both 
mimics substrate and has higher binding affinity can act as a 
useful surrogate for the substrate, allowing the observation of 
ligand-induced dimerization and activation even at low sub¬ 
strate concentrations. 

X-ray Structure of MERS-CoV 3CL pro in Complex with Com¬ 
pound 6 —To gain atomic level detail and molecular insight into 
the mechanism for substrate-induced dimerization of MERS- 
CoV 3CL pro , we attempted to crystallize and determine the 
x-ray structures of the unliganded MERS-CoV 3CL pro mono¬ 
mer and the MERS-CoV 3CL pro covalently modified with com¬ 
pound 6. Unfortunately, we were unable to crystallize the unli¬ 
ganded MERS-CoV 3CL pro monomer after multiple attempts, 


but we were able to crystallize and determine the x-ray struc¬ 
ture of MERS-CoV 3CL pro in complex with compound 6 to a 
resolution of 1.6 A. The statistics for x-ray data collection, pro¬ 
cessing, and refinement are summarized in Table 4. The MERS- 
CoV 3CL pro and 6 complex crystallized as a biologically rele¬ 
vant, symmetrical dimer in space group C2 with one monomer 
in the asymmetric unit. Electron density for the entire protein 
was clearly visible and strong electron density ( F a — F c >4cr) 
was present for compound 6 within the active site (Fig. 6 A). 

MERS-CoV3CL pro Has a Smaller S 2 Pocket than SARS-CoV 
3CL pr " —The active site of MERS-CoV 3CL pro bound with com¬ 
pound 6 is shown in Fig. 6, A and B. Compound 6 is covalently 
bound to the active site cysteine (Cys-148) via a 1.8 A bond 
between the y-sulfur and the electrophilic j3-carbon of the 
Michael acceptor. The P' ,-ethyl ester carbonyl, which mimics 
the carbonyl of the scissile bond in a substrate, forms a hydro- 
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TABLE 4 

X-ray data collection and refinement statistics 



MERS-CoV 3CL pro -6 

MERS-CoV 3CL pro, ll 

Beamline 

LRL-CAT sector 31ID-D 

LS-CAT sector 21ID-G 

Data collection 



Wavelength (A) 

0.9793 

0.9786 

Resolution range (A) 

19.35-1.62 (1.68-1.62)“ 

50.00-2.10 (2.14-2.10)“ 

Protein monomers in asymmetric unit 

1 

4 

Space group 

Cl 

pi i 

Unit cell dimensions 



a, b, c (A) 

106.49, 57.31, 48.88 

63.44, 114.93, 92.34 

a, p, y O 

90, 112.78, 90 

90, 90.89, 90 

Total no. of reflections 

63,855 

816,216 

No. of unique reflections 

32,851 

76,865 

Multiplicity 

1.9 (1.9)“ 

2.2 (2.2)“ 

Completeness (%) 

95.0 (93.8)“ 

96.8 (93.8)“ 

Mean // al 

5.2 (1.3)“ 

11.17(1.83)“ 

tW (%)) 4 

8.3 (67.2)“ 

8.8 (58.6)“ 

Refinement 



Resolution range (A) 

19.35-1.62 

42.59-2.10 

No. of reflections in working set 

30824 

76623 

No. of reflections in test set 

2026 

2019 


17.8 

15.91 

K eeW 1 

21.7 

21.51 

No. of non-hydrogen atoms 



Protein/water 

2380/208 

9383/995 

r.m.s.d .; d bond lengths (A) 

0.007 

0.013 

r.m.s.d., bond angles (°) 

1.09 

1.35 

Ramachandran favored (%) 

99 

98 

Ramachandran outliers (%) 

0 

0 

Molprobity clash score 

3.3 

1.94 

Average B -factor (A 2 ) 

20.4 

33.1 

Protein 

19.8 

32.5 

Ligands 

16.6 

41.1 

Solvent 

27.7 

37.9 


a Values in parentheses are for highest resolution shell. 

4 Emerge = - (I(h))\^fM where I,(h) is the ith measurement and (1(h)) is the weighted mean of all measurements of 1(h). 

c ^work an d ^free = M|T(/z)J — I F(b) c \)/h \ F(h ) 0 | for reflections in the working and test sets, respectively. 
d r.m.s.d. is root mean square deviation. 


gen bond with the backbone NH of Gly-146 that forms part of 
the oxyanion hole (Fig. 65). Within the .S', subsite, the P\ -lactam 
carbonyl, which is a surrogate for the amide of P, -glutamine of 
substrates, participates in a hydrogen bonding interaction with 
the imidazole ring of His-166, and the T,-lactam NH forms a 
hydrogen bond with the carboxylate oxygen of Glu-169. The 
-P 2 -backbone amide NH forms a hydrogen bond with the side 
chain carbonyl of Gln-192 (Fig. 6 B). The ^-leucine side chain 
atoms of the inhibitor make hydrophobic contacts with the side 
chains of Met-168 and Leu-49 that line the S 2 subsite pocket. 
Moreover, compared with the equivalent residue Thr-25 in 
SARS-CoV 3CL pro , Met-25 in the S 2 pocket of MERS-CoV 
3CL pro is expected to reduce the size of the hydrophobic 
pocket, which is supported by our observed SAR described 
above. 

The smaller size of the S 2 pocket in MERS-CoV 3CL pro is also 
consistent with the preference for a smaller leucine residue at 
the P 2 position of cleavage sites instead of a bulkier phenylala¬ 
nine or methionine residue. Indeed, analysis of the preference 
for leucine or phenylalanine at the P 2 position for the 11 3CL pro 
cleavage sites within the polyprotein of MERS-CoV shows that 
none of the 11 cleavage sites contain a phenylalanine residue at 
this position (Fig. 6 Q. Leucine is the predominantly favored 
residue at this position followed by methionine. Analysis of the 
cleavage sites from SARS-CoV, HKU4-CoV, and HI<U5-CoV 
shows that none of the 11 cleavage sites from group 2c mem¬ 
bers (MERS-CoV, HKU4-CoV, and HI<U5-CoV) contain a phe¬ 
nylalanine residue at the P 2 position; however, the SARS-CoV 


nsp5 | nsp6 cleavage site contains a phenylalanine residue at 
this position. 

Other interactions are also observed to play a significant role 
in stabilizing the MERS-CoV 3CL pr °-compound 6 complex. 
The /L-carbonyl and P :i -NH participate in hydrogen bonding 
interactions with the backbone NH and carbonyl of Glu-169. 
The ^-serine side chain is within hydrogen bonding distance of 
the side chain carboxamide of Gln-195 and the backbone car¬ 
bonyl of Lys-191. 

X-ray Structure ofMERS-Co V3CL pro in Complex with a Non- 
covalent Inhibitor —We were also able to obtain diffraction 
quality crystals of MERS-CoV 3CL pro in complex with com¬ 
pound 11, which has an almost identical chemical structure as 
that of compound 10 (Fig. 6 D). We previously showed that 
compounds similar to 10 and 11 act as potent noncovalent 
inhibitors of 3CL pro from SARS-CoV (33). The x-ray structure 
of compound 11 bound to MERS-CoV 3CL pro was determined 
to a resolution of 2.1 A and the x-ray data collection, processing, 
and refinement statistics are summarized in Table 4. The 
MERS-CoV 3CL pro and 11 complex crystallized in space group 
P2 1 with two biologically relevant dimers in the asymmetric 
unit. The overall root mean square deviation between the C-a 
atoms of the four chains was less than 1 A, with the highest C-a 
root mean square deviation of 0.719 A between chains C and D. 
Strong electron density ( F a — F c >4cr) was present for com¬ 
pound 11 within all the four active sites of the two dimers (Fig. 
6 D). 
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FIGURE 6. X-ray crystal structure of MERS-CoV 3CL pro in complex with inhibitors. A, solvent-accessible surface ( gray-shaded surface) of MERS-CoV 3CL pro 
and compound 6 complex. Compound 6 is displayed in ball and stick model with atoms colored as follows: carbons (orange), nitrogens (blue), and oxygens (red). 
Electron density associated with compound 6 is shown as an F a - F c electron density difference map contoured to 3a(green mesh). Substrate binding pockets 
5 4 -S', are labeled, where asterisk indicates the electrophilic carbon of compound 6 that forms a C-S covalent bond with the active site cysteine Cys-148. B, 
MERS-CoV 3CL pro and compound 6 complex with the MERS-CoV 3CL pro backbone represented as a ribbon model and relevant amino acids that interact with 
compound 6 represented as ball and sticks. MERS-CoV 3CL pro carbon atoms are colored blue, and compound 6 carbon atoms are colored orange. Nitrogen 
atoms are colored blue, and oxygen atoms are colored red. Catalytic residues Cys-148 and His-41 are also shown. Hydrogen bonds are depicted as red dashed 
lines. C, sequence logos showing amino acid conservation for the 11 polyprotein cleavage sites of different 3CL pro enzymes (MERS-CoV, HKU5-CoV, HKU4-CoV, 
and SARS-CoV), generated using the WebLogo server (63). Residues P 2 -P', are shown. Height of each letter corresponds to the amino acid conservation at that 
position. D, solvent-accessible surface (gray-shaded surface) of MERS-CoV 3CL pro and compound 11 complex. Compound 11 is displayed in ball and stickmode). 
Electron density associated with compound 11 is shown as a 2 F a - F c electron density difference map contoured to 1.5cr (green mesh). Functional groups of 
compound 11 with their corresponding binding pockets are highlighted in yellow, green, and blue ellipses. Chemical structure of compound 11 is shown in the 
inset. E, interactions between MERS-CoV 3CL pro and compound 11 are illustrated. Catalytic residues Cys-148 and His-41 are also shown. Hydrogen bonds are 
depicted as red dashed lines. 


The binding orientation for compound 11 in the active site of 
MERS-CoV 3CL pro is similar to the binding orientation of 
related compounds in the active site of SARS-CoV 3CL pro (PDB 
code 4MDS). The benzotriazole group binds in the .S’, subsite; 
phenyl propionamidyl occupies the S^-Sj subsite, and the thio¬ 
phene group binds in the S 2 subsite. Compound 11 also forms 
two direct and one water-mediated hydrogen bond interactions 
with amino acids in the MERS-CoV 3CL pro active site (Fig. 6 E). 
The A/3 of the benzotriazole ring forms a hydrogen bond with 
the side chain e-nitrogen of conserved His-166, and the central 
acetamide oxygen forms a hydrogen bond with the backbone 


NH of conserved Glu-169. The NH of the phenyl propionami¬ 
dyl group interacts with backbone carbonyl oxygen of the cat¬ 
alytic His-41 residue through a water-mediated hydrogen 
bond, and the imidazole ring of His-41 engages with the phenyl 
ring of phenyl propionamidyl group through T-shaped it stack¬ 
ing. The phenyl ring also form hydrophobic contacts with 
Leu-49. 

Interactions at the 3CL pro Dimer Interface —Analysis of the 
MERS-CoV 3CL pro and 6 and MERS-CoV 3CL pro and 11 crys¬ 
tal structures reveals key differences between the dimer inter¬ 
face of MERS-CoV and SARS-CoV 3CL pro (PDB code 2ALV) 
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A. Monomer A 


Monomer B 



Domain I 


Domain III 


MERS-CoV 3CL pro 
HKU4-CoV 3CL pr0 
SARS-CoV 3CLP ro 


Domain 


B. 



K137 (B) 


C. 



FIGURE 7. Comparison of x-ray crystal structures of 3CL pro dimers from MERS-CoV, HKU4-CoV, and SARS-CoV. A, superposition of dimers of MERS-CoV 
3CL pro (pink color), HKU4-CoV 3CL pro (yellow color, PDB code 2YNB), and SARS-CoV 3CL p, ° (blue color, PDB code 2ALV). For SARS-CoV 3CL pro , residues Arg-4 and 
Ser-123 from monomer A, and residues Gin-127, Lys-137, Glu-290, and Met-298from monomer B are represented as spheres. B, for SARS-CoV 3CL pro , interac¬ 
tions between the side chain of Arg-4 from monomer A and Gln-127, Glu-290, and Lys-137 residues from monomer B are shown. The corresponding residues 
in MERS-CoV 3CL pro and FiKU4-CoV 3CL pro are Val-4 in monomer A and Glu-290 in monomer B, which do not interact at the dimer interface. C, for SARS-CoV 
3CL pro , Ser-123 from monomer A engages in hydrogen bonding with Arg-298 from monomer B across the dimer interface. The corresponding residue in 
monomer B of MERS-CoV 3CL pro and HKU4-CoV 3CL pro is Met-298, which does not participate in any interaction with Thr-126 from monomer A across the dimer 
interface. 


(Fig. 7) (30). Two arginine residues, Arg-4 and Arg-298 (Fig. 7, 
A-C), form some of the key interactions at the dimer interface 
of SARS-CoV 3CL pro , and mutation of either of these amino 
acids results in a drastic loss of dimerization in SARS-CoV 
3CL pro (36, 38). Interestingly, these two arginine residues 
(Arg-4 and Arg-298) are substituted in MERS-CoV 3CL pro by 
two hydrophobic residues (Val-4 and Met-298) that are unable 
to participate in the formation of hydrogen bonds or salt 
bridges. Therefore, we initially thought that the loss of these key 
interactions might simply explain the > 100,000-fold weaker 
dimerization observed for MERS-CoV 3CL pro compared with 
SARS-CoV 3CL pro . Surprisingly, however, structural analysis of 
the dimer interface from the available x-ray structure of HKU4- 
CoV 3CL pro (PDB code 2YNB; Fig. 7, B and C), and primary 
sequence alignment of 3CL pro from MERS-CoV, FIKU5-CoV, 
FIKU4-CoV and SARS-CoV (Fig. 8) revealed that Val-4 and 
Met-298 are conserved between all the |3-CoV 2c members 
studied here. Substantial differences between the ability of 
MERS-CoV 3CL pro and HI<U4/HKU5-CoV 3CL pro to 
dimerize, despite their high sequence identity, led us to the 
hypothesis that nonconserved residues between MERS-CoV 
and other J3 -CoV 2c members that are remote from the dimer 
interface may play a significant role in dimer formation. 

Analysis of Nonconserved Residues of MERS-CoV 3CL pro — 
Analysis of our current crystal structures does not reveal a 
clear mechanism for the monomer to dimer switch of MERS- 


CoV 3CL pro upon ligand binding. Therefore, we attempted 
to identify the nonconserved residues in MERS-CoV 3CL pro 
that might affect enzymatic activity due to their proximity to 
key residues involved in substrate binding and/or dimer 
formation. 

Based on a sequence alignment, MERS-CoV 3CL pro contains 
—24 nonconserved amino acids (pink arrows in Fig. 8). Upon 
analyzing the position of these amino acids in the crystal struc¬ 
ture, we observed that a remarkable number of these amino 
acids are present in the loop regions. Fig. 9 A illustrates the 
nonconserved residues present in the loop regions as gray 
(monomer A) and pink (monomer B) spheres. Interestingly, we 
also observed that there are hot spots in the protein structure 
where most of these amino acids are clustered. These hot spots 
include the N-terminal region, the active site region, the inter¬ 
domain loop (loop between the catalytic fold and domain III), 
and the domain III. In MERS-CoV 3CL pro , nonconserved 
amino acid His-8, which forms van der Waals contacts with 
Lys-155 of the same monomer and Thr-128 of the other mono¬ 
mer, is present at the end of the N-terminal finger (Fig. 9, B and 
C), whereas amino acids Asp-12 and Ala-15 are part of the 
N-terminal helix (Fig. 9 B). Additionally, amino acids Thr-128, 
Lys-155, and Ser-158 are present within 6 A of the N-terminal 
region (Fig. 9 B). Substitution to these amino acids in MERS- 
CoV 3CL pro might have changed the protein dynamics in a way 
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FIGURE 8 . Sequence alignment of 3CL pro enzymes from MERS-CoV, HKU5-CoV, HKU4-CoV, and SARS-CoV. Programs MultAlin (64) and ESPript (65) were 
used for the sequence alignment and visualization. Secondary structural elements of MERS-CoV 3CL pro are represented as spirals for a-helix, arrows for 
/3-strands, 17 for 3 10 helix, and Tfor (3-turns. Residues Val-4 and Met-298 in MERS-CoV, HKU5-CoV, FIKU4-CoV 3CL pro , and Arg-4 and Arg-298 in SARS-CoV are 
shown in a green box; catalytic residues His-41 and Cys-148are highlighted in a purple box. The nonconserved residues of MERS-CoV 3CL pro are marked With pink 
arrows. % identity with MERS-CoV 3CL pro is shown. 
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FIGURE 9. Analysis of the nonconserved amino acids of MERS-CoV 3CL pro . A, representation of MERS-CoV 3CL pro dimer with monomers A and B colored in 
orange and yellow, respectively. Nonconserved residues that are present in the loop regions are shown as spheres in gray and pink for monomers A and B, 
respectively. Other nonconserved residues are represented as spheres with the corresponding chain color. Domains l-lll and the inter-domain loop are labeled. 
Catalytic residues His-41 and Cys-148 are shown as green spheres. Inhibitor molecule is shown in both active sites in blue sticks. B-G, residues of monomer Bare 
shown (yellow and pink), unless otherwise labeled. B, clustering of some of the nonconserved amino acids, His-8, Asp-12, Ala-15, Thr-128, Lys-155, and Ser-158, 
near the N-terminal region is shown. N-terminal helices for both monomers are labeled. C, His-8 from the N-terminal region forms van der Waals contacts with 
Lys-155 of the same monomer and Thr-128 of the other monomer in the dimer. D, nonconserved residue Met-61 forms hydrophobic contacts with the Met-43 
residue, which is in close proximity to the catalytic residue His-41. E, loop containing the nonconserved residue Ala-171 forms the S, pocket along with residues 
His-166 and His-175. F, Val-132 forms hydrophobic contacts with a residue within the same domain (Ala-114), as well as Glu-290 from domain III. G, noncon¬ 
served residue Tyr-137 makes hydrophobic contacts with Tyr-185;Tyr-185 along with two other nonconserved residues Thr-183 and Met-189 are present on 
the inter-domain loop. 


that only ligand binding populates the monomer conformation, 
which is more amenable to dimer formation. 

We also observe that some of the nonconserved residues in 
MERS-CoV 3CL pro are located in proximity to the substrate¬ 
binding site and might contribute toward ligand-induced 
dynamic changes favorable for dimer formation. For example, 
nonconserved amino acid Met-61 forms hydrophobic interac¬ 
tions with Met-43, which in turn is in close proximity to the 
catalytic residue His-41 (Fig. 9 D). Residue Ala-171 is present on 
a loop, and this loop, along with conserved residues His-166 
and His-175, forms the .S', subsite for binding the P, amino acid 
of the substrate (Fig. 9 E). In addition to its influence on sub¬ 
strate binding, Ala-171 may also contribute toward dimer for¬ 
mation upon substrate binding due to its close proximity with 
Glu-169. This glutamate residue in SARS-CoV 3CF pro (Glu- 
166) has been established as a key residue linking the substrate¬ 


binding site to the dimer interface (56). Val-132 forms hydro- 
phobic interaction with other nonconserved residue Ala-114 
within domain II (Fig. 9 F). Additionally, Val-132 is present 
within van der Waals contact distance of Glul-290 from extra¬ 
helical domain III (Fig. 9 F). It is noteworthy that Glu-290 forms 
a salt bridge with Arg-4 across the dimer interface in SARS- 
CoV 3CF pro . However, this interaction is not formed in MERS- 
CoV 3CF pro due to the substitution of Arg-4 with Val-4. Tyr- 
137 forms hydrophobic contacts with the conserved residue 
Tyr-185 (Fig. 9G). 

Besides amino acid Val-132 that connects domains II and III, 
residue Tyr-185, along with two other nonconserved residues, 
Thr-183 and Met-189, is present on the inter-domain loop that 
connects the catalytic fold (domains I and II) with the extra-helical 
domain III (Fig. 9G). Flexibility within these residues might affect 
the orientation of domain III required for dimer formation. 
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FIGURE 10. Proposed model for polyprotein processing in MERS-CoV regulated by ligand-induced dimerization of MERS-CoV 3CL pro . MERS-CoV 3CL pro 
domains I and II are together represented as the rectangular box, and domain III is represented as a cylinder. The N and C termini are labeled, and the yellow 
cylinder labeled S represents a ligand that can be a peptide inhibitor, peptide substrate, or 3CL pro cleavage sites in the polyprotein. Various steps required for 
the auto-release of 3CL pro from the polyprotein and subsequent processing of the polyprotein cleavage sites are described in the text. Suggested by our AUC 
and kinetic studies, the shaded region (steps 5 and 6) high lights the additional steps MERS-CoV 3CL pro would undertake during polyprotein processing and have 
been described in the kinetic model depicted in Fig. 56. 


Discussion 

Modelfor Regulation of the Enzymatic Activity ofMERS-Co V 
3CL pro during Polyprotein Processing —Enzymatic activity of 
coronavirus 3CL pro is required for the processing of viral poly¬ 
proteins at 11 distinct cleavage sites, allowing the release of 
nonstructural proteins that subsequently form a replication 
complex for virus genome replication. Because of its indispens¬ 
able role in the virus life cycle, regulation of the enzymatic activ¬ 
ity of 3CL pro is instrumental for efficient replication of corona- 
viruses. Based on our experimental results, we propose a model 
to explain the mechanism for regulating the enzymatic activity 
of MERS-CoV 3CL pro in the context of polyprotein processing 
during virus infection (Fig. 10). 

A number of in vitro studies performed on SARS-CoV 
3CL pro have established the mechanism for 3CL pro auto-release 
from the polyprotein (34, 39, 40). Based upon these studies and 


our data on MERS-CoV 3CL pro , we propose the polyprotein 
processing model in Fig. 10. The steps proposed for auto-re¬ 
lease of MERS-CoV 3CL pro from the polyprotein ( steps 1-4, 
Fig. 10) have been adapted from Chen et al. (39), where it is 
suggested that the N-terminal auto-processing does not require 
the formation of a mature 3CL pro dimer for SARS-CoV. Based 
on the differences between the properties of SARS-CoV 3CL pro 
and MERS-CoV 3CL pro , as highlighted in our studies, we added 
two additional steps ( steps 5 and 6, Fig. 10) that MERS-CoV 
3CL pro may need to utilize for efficient polyprotein processing. 
In Fig. 10, step 1, two immature MERS-CoV 3CL pro monomers 
in the polyprotein approach each other and form an immature 
dimer via interactions between domain III, which allows each of 
the monomers to insert their N termini into the active site of the 
other monomer. In step 2, the N termini are cleaved, and the 
dimer with uncleaved C termini adopts a conformation similar 
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to the mature dimer. Our observation of auto-cleavage of the 
N-terminal His 6 tag from MERS-CoV 3CL pro during expres¬ 
sion in bacterial cells supports steps 1 and 2, where formation of 
an immature dimer capable of auto-processing the N terminus 
occurs. In step 3, two dimers with uncleaved C termini 
approach each other, followed by insertion of the C terminus 
from one dimer into one of the active sites of the other dimer. In 
step 4, the C termini are cleaved and mature dimer is released 
from the polyprotein. 

For SARS-CoV, the 3CL pro dimer formed in step 4 continues 
to process cleavage sites in the polyprotein, effectively skipping 
steps 5 and 6 ( red arrow in Fig. 10) because the dimer is tightly 
associated. However, the high K d value of MERS-CoV 3CL pro 
dimer suggests that the active and mature dimer may dissociate 
into inactive, mature monomers in the absence of any ligand 
(step 5). In order for polyprotein processing to proceed, another 
step (step 6) must occur. In step 6, a substrate S, e.g. one of the 11 
polyprotein cleavage sites, would induce dimer formation and 
hence activate catalysis and cleavage at the substrate recogni¬ 
tion sites. Our AUC results and the kinetic activation studies 
performed in the absence and presence of inhibitors support 
steps 5 and 6 where the inactive but mature monomers require 
binding of a ligand to undergo ligand-induced dimerization and 
formation of an active, mature dimer that can then process the 
polyprotein cleavage sites. 

Nonconserved Amino Acids of MERS-CoV3CL pro May Regu¬ 
late the Dimer Formation —Long range interactions have been 
reported to modulate dimerization and activity of 3CL pro 
enzymes. Barrila et al. (57) demonstrated that mutation of a 
conserved amino acid Ser-147, which is distant from the dimer 
interface, results in a total loss of dimerization and enzymatic 
activity of SARS-CoV 3CL pro . Although Ser-147 does not form 
direct interactions at the dimer interface, disruption of the 
dimer upon mutation stems from the fact that Ser-147 makes 
several interactions with other residues involved in forming a 
hydrogen bonding network within SARS-CoV 3CL pro . Site-di¬ 
rected mutagenesis studies on domain III of SARS-CoV 3CL pro , 
where N214A and S284A/T285A/I286A mutants were charac¬ 
terized, revealed that despite being present on an entirely dif¬ 
ferent domain, these residues affect catalysis through a network 
of residues undergoing correlated motions across the entire 
protease (58, 59). Utilizing 3CL pro temperature-sensitive 
mutants of MHV, Stobart et al. (60) have also demonstrated 
that second-site mutation physically distant from the tempera¬ 
ture-sensitive mutation suppresses the temperature-sensitive 
phenotype through long range interactions, thereby regulating 
3CL pro enzymatic activity during polyprotein processing and 
virus replication. 

Our studies also suggest that long range interactions among 
the nonconserved residues can significantly alter the properties 
of MERS-CoV 3CL pro . A detailed analysis of nonconserved res¬ 
idues of MERS-CoV 3CL pro among /3-CoV 2c members identi¬ 
fied hot spots, including the N-terminal finger and helix, the 
active site region, the inter-domain loop, and the domain III, 
where these residues are clustered. Several studies done on 
SARS-CoV 3CL pro have demonstrated that amino acids from 
the N-terminal finger, the N-terminal helix, and domain III 
significantly contribute toward dimer formation. 


In addition to the direct interactions at the dimer interface, 
correct orientation between the catalytic fold and domain III is 
also crucial for dimer formation. Wu et al. (61) showed that the 
mostdramaticdifferencebetweenthecrystalstructuresofmono- 
mer and the ligand-bound dimer of the R298A mutant of SARS- 
CoV 3CL pro was a 33° rotation of domain III (38). This rotation 
results in a steric clash between domain III from two monomers 
and would essentially block dimer formation. However, upon 
addition of a ligand, domain III of the R298A mutant adopts the 
correct orientation and results in the formation of a dimer 
structure. Similar to the SARS-CoV 3CL pro R298A mutant, 
ligand binding into the active site of the MERS-CoV 3CL pro 
monomer possibly stabilizes the inter-domain loop conforma¬ 
tion that maintains domain III in the correct orientation for 
dimer formation. Most of the nonconserved residues within 
domain III are present on the surface and also are distant from 
the dimer interface. These residues may be involved in provid¬ 
ing the flexibility required for conformational changes during 
the monomer to dimer switch. 

We have identified several amino acids in MERS-CoV 3CL pro 
that may contribute to the dimer formation upon ligand bind¬ 
ing. However, single amino acid mutagenesis alone is unlikely 
to reveal significant differences in the dimerization properties. 
As demonstrated by Myers et al. (62) for ornithine decarboxy¬ 
lase, the response of single amino acid to ligand binding may be 
limited to only local conformational changes and may not have 
significant contribution toward dimer stability. However, local 
conformational changes in a network of residues may propa¬ 
gate larger effects that stabilize dimer formation upon ligand 
binding. Analysis of the nonconserved residues of MERS-CoV 
3CL pro discussed here sets forth a framework to perform sys¬ 
tematic single or multiple mutagenesis studies to gain insights 
into the mechanism for ligand-induced dimerization of the 
enzyme. 

Development of 3CL pro Inhibitors with Broad Spectrum 
Specificity— Insights into the mechanistic and structural simi¬ 
larities as well as differences between 3CL pro enzymes from 
different coronavirus subgroups are instrumental for the devel¬ 
opment of 3CL pro inhibitors with broad spectrum specificity. 
To evaluate the broad spectrum specificity of our peptidomi- 
metic compounds, we determined their inhibitory activity 
against 3CL pro from MERS-CoV, SARS-CoV, HKU5-CoV, and 
HI<U4-CoV. Our inhibitory data and Revalues clearly show that 
compounds 6-9 inhibit all the 3CL pro enzymes tested here. 
The x-ray structure of MERS-CoV 3CL pro in complex with 
compound 6 revealed that out of eight direct hydrogen bonds 
formed between compound 6 and MERS-CoV 3CL pro , four of 
these hydrogen bonds involve interactions with conserved 
structural elements of the peptide backbone of the enzyme. 
Furthermore, the amino acids that form hydrogen bonds with 
compound 6 through side chain interactions are conserved in 
all the coronavirus 3CL pro enzymes evaluated here, as well as 
3CL pro enzymes from other j3-coronaviruses like MHV, OC43, 
and HKU1. These results suggest that canonical structural fea¬ 
tures exist among the 3CL pro enzymes that can be exploited for 
structure-based design of broad spectrum inhibitors. 

For the noncovalent inhibitor compound 11, the x-ray struc¬ 
ture reveals two direct hydrogen bonding interactions between 
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the compound and MERS-CoV 3CL pro . One of the hydrogen 
bonds forms with the side chain e-nitrogen of conserved His- 
166, and the second involves the backbone NH of conserved 
Glu-169. We speculate these interactions remain conserved in 
other 3CL pro enzymes as well, because His-166 and Glu-169 
amino acids are conserved in all 3CL pro enzymes. In fact, the 
crystal structure of SARS-CoV 3CL pro in complex with an 
inhibitor similar to compound 11 (PDB code 4MDS) reveals 
that the interactions of the inhibitor with the amino acids His- 
166 and Glu-169 are conserved. 

The identification of 3CL pro -inhibitor interactions utilizing 
conserved elements of the protein structure, including the pep¬ 
tide backbone and conserved side chains of active site residues, 
suggests that the development of broad-spectrum inhibitors of 
coronavirus 3CL pro is feasible. 

Our studies here demonstrate the unique properties of 
MERS-CoV 3CL pro among j3-CoV 2c members, evident from 
the requirement for a ligand to induce dimerization. Although 
the peptidomimetic compounds containing a Michael acceptor 
group (for example, compounds 6 -9) induce dimer formation 
of MERS-CoV 3CL pro , the irreversible nature of their reaction 
with the active site cysteine ensures complete inhibition of the 
enzyme at stoichiometric ratios in a time-dependent manner. 
On the contrary, noncovalent peptidomimetic compounds (for 
example, compounds 10 and 11) inhibit the enzymatic activity 
of MERS-CoV 3CL pro only at high compound concentrations. 
Based on these observations, compounds that irreversibly mod¬ 
ify the 3CL pro active site may serve as better candidates for the 
development of inhibitors for MERS-CoV 3CL pro . 

Potential Complexity in the Development of MERS-CoV 
3 CL pro Inhibitors as Antiviral Agents —Induced dimerization of 
MERS-CoV 3CL pro , as seen in the presence of peptidomimetic 
inhibitors, has significant implications in the development of 
antiviral agents targeting MERS-CoV 3CL pro . As a consequence 
of enzyme activation, the development of an effective antiviral 
agent may necessitate the development of a compound that can 
inhibit the MERS-CoV 3CL pro monomer and stabilize it with¬ 
out inducing dimerization and/or inhibit the active sites of the 
dimer at low doses, ensuring inactivation of both the active sites 
within the dimer. On the contrary, it is also possible that the 
presence of an inhibitor could enhance the activity of MERS- 
CoV 3CL pro to an extent that results in a complete loss of the 
temporal and spatial regulation of the enzymatic activity, 
thereby disrupting viral genome replication. Ramifications of 
ligand-induced dimerization and activation of MERS-CoV 
3CL pro , as seen in the presence of lower concentrations of 
inhibitor, will need to be further explored in virus-infected 
cells. 
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